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Abstract: Legumes, as protein-rich crops, are widely used for human food, animal feed and vegetable oil production. 
Over the past decade, two legume species, Medicago truncatula and Lotus japonicus, have been adopted as model leg- 
umes for genomics and physiological studies. The tobacco transposable element, Tntl, is a powerful tool for insertional 
mutagenesis and gene inactivation in plants. A large collection of Tntl -tagged lines of M. truncatula cv. Jemalong was 
generated during the course of the project 'GLIP': Grain Legumes Integrated Project, funded by the European Union 
(www.eugrainlegumes.org). In the project TFCOSMO': Integrated Functional and Comparative genomics Studies on the 
MOdel Legumes Medicago truncatula and Lotus japonicus, supported by a grant from the Ministry of Education, Youth 
and Science, Bulgaria, these lines are used for development of functional genomics platform of legumes in Bulgaria. This 
review presents recent advances in the evaluation of the M. truncatula Tntl mutant collection and outlines the steps that 
are taken in using the Tntl -tagging for generation of a mutant collection of the second model legume L. japonicus. Both 
collections will provide a number of legume-specific mutants and serve as a resource for functional and comparative ge- 
nomics research on legumes. Genomics technologies are expected to advance genetics and breeding of important legume 
crops (pea, faba bean, alfalfa and clover) in Bulgaria and worldwide. 
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1. PLANT GENOMIC AND POST-GENOMIC ERA 
WORLDWIDE AND IN BULGARIA 

Plant genomics research started with a complete genome 
sequencing of the model plant Arabidopsis thaliana in De- 
cember 2000 [1]. The Arabidopsis community proposed an 
ambitious goal to identify the function of every gene by 
2010. Since then, an extremely rapid progress has been made 
in the field of plant genomics. An important step ahead was 
the publication of the rice genome sequence in 2002 [2, 3], 
draft genome of poplar in 2006 [4], whole genome sequence 
of two grapevine genotypes in 2007 [5], draft sequence of 
the genomes of Lotus japonicus [6, 7] and Glycine max in 
2010 [8]. Currently, the genomes of several other plant spe- 
cies as barley, wheat, potato, cotton, tomato, maize, Medi- 
cago truncatula have also been extensively explored. The 
rapid accumulation of sequencing information and global 
gene expression analyses enable us to generate hypotheses 
about gene function and answer specific biological ques- 
tions. A great challenge for the scientists of the post-genomic 
era is to analyze and understand function of every gene for 
all the sequenced genomes. 
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Legumes are protein-rich crop plants, used for human 
nutrition, animal feed and production of vegetable oil. A key 
contribution of legumes to sustainable agriculture and nitro- 
gen cycle comes from their ability to fix atmospheric nitro- 
gen in most agricultural ecosystems. They are able to form 
specialized symbiotic organs, the root nodules, in which 
rhizobial bacteria reside and reduce atmospheric nitrogen to 
ammonia. This process largely contributes to the nitrogen 
nutrition of the host plant, reducing the need for nitrogen 
fertilizers. Intensive application of chemical fertilizers has 
allowed developing and maintaining a very productive agri- 
culture. However, this application has resulted in nitrogen 
leakage from agricultural systems into groundwater, rivers, 
coastal waters. 

Over the past decade, two model legumes, M. truncatula 
[9, 10,] and L. japonicus [11-13] have been proposed for 
molecular genetics research. It is generally accepted that 
knowledge about certain shared characteristics of legumes, 
such as the pathways involved in symbiosis with rhizobia 
and synthesis of flavonoids and glycosides, is easily trans- 
ferable from model plants to crops [14]. Thus, information 
from these two models would be useful for improvement of 
forage legumes, e.g. alfalfa and clovers, even for the study of 
agronomic traits (yield and growth habits), because of the 
close relation of L. japonicus and M. truncatula to the forage 
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legume species Lotus corniculatus (birdsfoot trefoil) and 
Medicago sativa (alfalfa), respectively. Model legumes can 
also provide useful information and contribute to the im- 
provement of other crops, such as tomato, sunflower, cotton, 
corn and rice. In addition, the nitrogen fixation process in 
legumes has a unique biochemistry [15] and many valuable 
molecules with an important biomedical application, such as 
isoflavones and plant sterols, are found among legume me- 
tabolites [16]. 

The 'Medicago truncatula" Sequencing Consortium is an 
international partnership of research laboratories, which is 
decoding the genome sequence of M. truncatula. It was 
founded by the Samuel Robert Noble Foundation and Na- 
tional Science Foundation in the USA, and by the European 
Union. The European Grain Legumes Integrated Project 
(GLIP) was launched in 2004 to increase knowledge in grain 
legume biology in order to develop new strategies for pro- 
moting the cultivation of legumes in Europe, as well as their 
use in animal feed. Legumes are a key component for the 
development of sustainable agriculture, but they remain 
underused in Europe. The "Medicago truncatula' Sequenc- 
ing Consortium has made significant contribution to the de- 
velopment of an expressed sequence tag (EST) resourse and 
genome sequencing. A collection of 268 712 ESTs that rep- 
resents 38 238 unigenes and >89 Mb genomic sequences of 
the 200 Mb euchromatic regions have been made publicly 
available. Sequencing of the gene space of the eight Medi- 
cago chromosomes is almost completed (http://www.medi- 
cago.org). 

In the frame of the project TFCOSMO': Integrated Func- 
tional and Comparative genomics Studies on the MOdel 
Legumes Medicago truncatula and Lotus japonicus (sup- 
ported by a grant from the Ministry of Education, Youth and 
Science, Bulgaria), we investigate the utility and challenges 
of exploring forward and reverse genetic tools using the Tntl 
insertional mutant collection of M. truncatula created by 
AgroBioInstitute (ABI) during the GLIP project. In order to 
generate a mutant collection of the second model legume L. 
japonicus, particular attention is given to the use of the Tntl- 
mediated mutagenesis for establishment of starter lines for 
this model legume. Both collections will provide a number 
of legume-specific mutants and will serve as a base for ad- 
vanced functional and comparative genomics research on the 
legumes. 

2. TNT1 AS A POWERFUL TOOL FOR INSER- 
TIONAL MUTAGENESIS IN MEDICAGO TRUNCA- 
TULA AND MAYBE IN LOTUS JAPONICUS 

Generation of large mutant collections and subsequent 
analysis of the effect of gene tagging and mutation represent 
a fundamental approach for understanding gene function and 
genetic interactions. Different tools are used for gene disrup- 
tion and production of mutant collections. Establishment of 
several technological platforms, such as TILLING [17], mi- 
croarrays of several thousand non-redundant Lotus cDNAs 
for transcriptomic and proteomic profiling [18,19], and ad- 
vanced genome-sequencing programs [20] has made a major 
contribution to the functional genomics studies on the model 
legumes. A transposon-tagged L. japonicus mutant, nin (for 
nodule inception), has been isolated and characterized by 



Schauser et al. [21]. Imaizumi et al. [22] demonstrated that 
activation-tagging method can be successfully used in the 
model legume L. japonicus. Insertional mutagenesis via 
transposable elements [23, 24] represents attractive alterna- 
tive for creation of large mutant collections. Class I mobile 
genetic elements, or retrotransposons, copy themselves and 
paste copies back into the genome at multiple places [25]. 
Initially retrotransposons copy themselves to RNA but, in 
addition to being transcribed, the RNA is copied into DNA 
by a reverse transcriptase [26] and is inserted back into the 
genome. Because there is no excision during replicative 
transposition, mutations generated by retrotransposon inser- 
tions are stable. Efficient transposition of the Nicotiana ta- 
bacum Tntl retrotransposon in M. truncatula R108 line is 
firstly described by d'Erfurth et al. [27] during the early 
steps of in vitro transformation/regeneration process and 
confirmed for M truncatula cv. Jemalong 2HA [28]. The 
high efficiency of Tntl transposition during the regeneration 
process results in lines, carrying multiple Tntl inserts (from 
4 to 20 insertions per regenerated plant) and most likely, 
resulting in the accumulation of multiple mutations. The 
Tntl insertions are stable during the life cycle of M. trunca- 
tula and most of them are genetically independent and can be 
separated by recombination. In addition, Tntl seems to 
transpose preferentially into the gene rich regions of the M. 
truncatula genome [29, 30] and its multiplication by trans- 
position can be re-induced by tissue culture. Several of the 
generated developmental and symbiotic Tntl -tagged mutants 
have been already identified and characterised [31-33]. 

Based upon successful activity of the worldwide Medi- 
cago consortium, we propose an idea for initiation and de- 
velopment of a Tntl mutant collection for the second model 
legume L. japonicus. This collection will provide a number 
of legume-specific mutants and will serve as a resource for 
advanced functional genomics research of L. japonicus. In 
addition, the prospect of integrating genome information 
from both model legumes to achieve a platform for compara- 
tive genomics will provide novel insights into the organiza- 
tion and evolution of legumes, as well as the similarities and 
differences with genomes of other plant families. 

3. FUNCTIONAL LEGUME GENOMICS PLATFORM 
IN BULGARIA ON THE MOVE 

Recent developments in the area of legume genomics 
provide exciting opportunities for transferring knowledge 
from models into crop legumes. To date, most progress in 
identifying genes important for many aspects of legume bi- 
ology has been made in the two model legumes, M. trunca- 
tula and L. japonicus. In Bulgaria, M. truncatula research 
started with participation of ABI in the project 'Molecular 
and cellular bases of somatic embryogenesis in alfalfa', 
INCO-COPERNICUS (IC15-CT96-0906), FP5 of European 
Union. During the course of this study an efficient regenera- 
tion and transformation procedures for M. truncatula are 
established. Because of the successful project implementa- 
tion, ABI was invited as a partner to the GLIP project (FP6, 
FOOD-CT-2004-506223). The main research task of ABI 
together with the other partners in work package (WP) 5.2 
"Tntl mutagenesis" was to develop a large-scale Tntl mutant 
collection of M. truncatula. The team contributed greatly in 
defining conditions for efficient Tntl transposition, resulting 
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in lines with a high number of inserts [28]. Seeds of the M. 
truncatula mutant lines are currently available at ABI, as 
well as at the stock centers for seed production in Szeged, 
Hungary and Dijon, France. Two of the partners of ABI in 
the WP 5.2 were the Institut des Sciences du Vegetal, Centre 
national de la recherche scientifique (ISV, CNRS), repre- 
sented by Dr. Pascal Ratet (the leader of WP 5.2), and the 
Department of Plant Systems Biology, VIB/University of 
Ghent represented by Prof. Sofie Goormachtig. Because of 
the common interest in further exploitation of the mutant 
lines generated during GLIP and common research activities, 
these two institutions have become partners in the IF- 
COSMO project. Another Bulgarian project partner is the 
Institute of Plant Physiology and Genetics (IPPG), Bulgarian 
Academy of Sciences, which provides expertise in Gateway 
cloning, hydroponic cultivation, and a number of imaging 
and biochemical techniques. Since 2002, L. japonicus has 
been used as a model plant for investigation of symbiotic 
relationships with Mesorhizobium loti or purified M. loti 
lipochitin oligosaccharide signal molecules (Nod factors). 
Visualization and quantification of cortical microtubules 
dynamics in living root hairs of L. japonicus have been 
achieved [34, 35]. 

4. RECENT PROGRESS IN IFCOSMO PROJECT 

The M. truncatula tagged population is a very useful re- 
source for investigation of gene function in legumes. In order 
to confirm the presence of new copies of the Tntl retrotrans- 
poson, randomly selected lines from the Bulgarian Tntl mu- 
tant collection were screened by a transposon display (TD) 
technique or Inverse PCR (IPCR). Thirty of the positive lines 
from T[ generation were selected for assessment of their 
phenotype characteristics and symbiotic properties, accord- 
ing to the visible morphological criteria for mutant evalua- 
tion [29]. The selected Tntl mutants from Tj and the follow- 
ing T 2 generations were planted in a greenhouse and their 
root and shoot morphology, variations in plant growth, de- 
velopment, flowering and pigmentation were evaluated. Af- 
ter performing the greenhouse assessment, several pheno- 
typic classes could be distinguished: the first and largest 
class comprises the lines with extreme and moderate dwarf 
phenotype (Supplemental Fig. (1A)); the second largest class 
possesses inflorescence and floral organ defects, yellow- 
greenish and variegated leaf colour, mixed foliage leaves, 
early or late flowering (Supplemental Fig. (IB)). One of the 
Tntl lines showed quite unusual phenotypic features mani- 
fested by five- to seven-foliate leaves and defective flower 
development (Supplemental Fig. (1C)). Unfortunately, this 
line was sterile and did not produce any seeds. 

Root morphology and development, and symbiotic char- 
acteristics of the mutant lines were evaluated in vitro on agar 
plates and in vivo in hydroponic systems. In view of the root 
architecture, three main phenotypic groups were identified: 
mutant plants with long and thin roots and reduced lateral 
branches represent the first group (Supplemental Fig. (2A)), 
the second group includes lines with short roots having many 
secondary branches (Supplemental Fig. (2B)), and the third 
group was presented by the lines with short and unbranched 
root architecture (Supplemental Fig. (2C)). Evaluation of the 
root development in Ti and T 2 generations from seedling 
stage to maturity revealed mutant plants with very unusual 
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root phenotype. These plants produce many secondary 
branches that grow parallel to each other on the main root, 
resembling a fish bone. Based upon this root structure, the 
mutant was named 'fishbone' (Supplemental Fig. (2D)). Se- 
quencing of the Tntl insertion sites (FSTs) for this line is in 
progress. 

The selected Tntl lines revealed a wide variability in 
their symbiotic characteristics. They could be grouped into 
several distinct classes: the most numerous group demon- 
strated complete absence of nodulation (Nod"), the second 
group formed white spherical nodule-like structures (Nod + , 
Fix"), the third class was characterized by a mixture of white 
(inefficient) and efficient nodules (Nod + Fix~ /+ ), and very few 
lines formed an excessive number of nodules (Nod ++ ). There 
were also Tntl lines that could not be infected (Inf), and a 
line that formed elongated efficient nodules (Supplemental 
Fig. (3)). 

Part of the phenotyping activities on the selected Tntl 
mutants were focused on some aspects of leaf epidermal 
morphology [36]. The leaf epidermis is essential to plant 
functioning and survival not only because it is the first line 
of defense against environmental damages but also because 
of its crucial developmental role. Interactions between the 
epidermis and internal tissues regulate the overall leaf archi- 
tecture. Selected Tntl insertion lines showed significant 
variations in leaf epidermal cell size, shape and the number 
and distribution of stomata. There were also visible varia- 
tions in cell wall undulation, which was differently ex- 
pressed in the upper and lower epidermis. 

Pictures, microscope images and other data and informa- 
tion are collected in order to create an online catalogue of the 
investigated mutant lines. 

As mentioned above, the plant genomic region that bor- 
ders Tntl (Flanking Sequence Tag, FST) can be identified by 
TD or Inverse PCR. All the FSTs that are sequenced by vari- 
ous partners are stored in a database for public use at the 
Samuel Roberts Noble Foundation (http://bioinfo4.noble.org/ 
mutant/). On the base of these sequences, two mutant lines 
(So 6142A h So 5945 A) from the ABI's collection were se- 
lected because of the location of their FSTs in the coding 
sequence of three important genes. They correspond to inser- 
tions in a 20G-Fe (II) oxygenase (NP7260384, line 6142A, 
insertion 9), HAC1, transcription cofactor (NP7260991, line 
5945A insertion 5) and Cyclin-like F-box protein 
(NP7270344, line 5945A insertion 7). Two FSTs partially 
correspond to the genes encoding an Auxin influx carrier 
protein, LAX3 (NP6 11743) and Transcriptional factor B3 
family protein, similar to Auxin response factor 
(NP7272450). These five genes were selected for molecular 
cloning and characterization because of their involvement in 
fundamental processes of plant development, participation in 
secondary root branching, nitrogen fixation and somatic em- 
bryogenesis. The selected genes were cloned into overex- 
pression and knockout vectors by the GATEWAY technol- 
ogy. Transcriptional reporters consisting of promoter frag- 
ments of the genes of interest, driving GFP and GUS, were 
also constructed. Building tools for studying transgenic M. 
truncatula, L. japonicus and A. thaliana (used as a reference 
species) plants, where these genes are either overexpressed 
or silenced through RNA interference (RNAi) technology 
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[37] are currently in progress. In order to find homozygous 
'tag' mutant individuals among Fi progeny of two of the 
above mentioned lines (insert 9 of line So 6142, and insert 5 
and 7 of line So 5945), for the respective FSTs a reverse 
genetic approach will be used. 

Assessment of experimental data for the generation of 
starter lines and Tntl mutant collection for M. truncatula has 
shown the presence of the auxin 2.4-dichlorophenoxyacetic 
acid (2.4D) during callus tissue induction as an important 
component for successful transposition of the tobacco Tntl 
retroelement [28]. Besides the presence of 2.4D in culture 
media, other important components ensuring high transposi- 
tion efficiency of Tntl during regeneration of mutant lines 
are the induction of friable callus tissue to full dedifferentia- 
tion of plant explants, and the osmotic pretreatment of starter 
explants. Unfortunately, the efficiency of the existing in vitro 
techniques and procedures for regeneration and gene transfer 
in L. japonicus does not correspond to the requirement for 
effective Tntl introduction and transposition. The data on the 
regeneration and gene transfer for L. japonicus published so 
far, illustrated successful use of hypocotyl explant and plant 
growth regulators like a-naphthaleneacetic acid (NAA) and 
6-benzylaminopurine (BAP). Previously developed regen- 
eration protocols induce compact callus tissue, followed by 
shoot development. One of the main objectives of the IF- 
COSMO project, concerning construction of L. japonicus 
Tntl starter line and further generation of insertional mutant 
lines, is to develop an indirect somatic embryogenesis proto- 
col using 2.4D as the main callus inducer. During the first 
stage of the project, new regeneration/transformation proto- 
cols were established, which allowed obtaining starter lines, 
carrying the Tntl retrotansposon in a low copy number. The 
TD and IPCR analyses confirmed the presence of one to 
eight transposed copies of the element per line. The experi- 
mental data available for M. truncatula starter lines demon- 
strate that four to eight Tntl copies are required for the 
newly introduced insertions [28]. The second part of the 
IFCOSMO project will focus on further evaluation of the 
starter lines and setting up conditions for efficient Tntl 
transposition in L. japonicus. 

Recent progress in molecular biology, genomics, tran- 
scriptomics, proteomics, metabolomics and bioinformatics 
for rice, A. thaliana, M. truncatula and L. japonicus has pro- 
vided new opportunities for investigation of these model 
species [38-41]. The knowledge obtained by comparative 
genomics analyses of legumes and other model plants, such 
as Arabidopsis and rice, could allow the identification of 
legume specific systems and provide insights into biological 
phenomena of legume crops. Genomes of the model species 
M. truncatula and L. japonicus share considerable genetic 
synteny with crop legumes, which greatly facilitates gene 
discovery and understanding the complex relationships be- 
tween genes and phenotypes. To support comparative ge- 
nome analyses among legumes, up-to-date genetic and ge- 
nomic databases, integrating information from multiple leg- 
ume species have been developed (LIS, 
http://www.comparative-legumes.org/lis/ and LegumeDB, 
http ://ccg .murdoch. edu . au/index) . 

Successful translation of accumulated knowledge from 
models to practical agriculture would lead to new discoveries 



about developmental processes and application of new 
strategies for crop improvement. In a long-term plan, devel- 
opment of functional and comparative genomics platform 
would be of particular agricultural significance, facilitating 
the genetics and breeding of important legume crops culti- 
vated in Bulgaria and worldwide, such as pea, faba bean, 
alfalfa and clover. 
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